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Abstract 

r/3 . In this paper, we consider a scenario where one energy harvesting and one 

O 

battery operated sensor cooperatively transmit a common message to a dis- 



tant base station. The goal is to find the jointly optimal transmission (power 
allocation) policy which maximizes the total throughput for a given dead- 



> 

in 
oo 

. line. First, we address the case in which the storage capacity of the energy 

harvesting sensor is infinite. In this context, we identify the necessary con- 
CN ■ ditions for such optimal transmission policy. On their basis, we first show 

that the problem is convex. Then we go one step beyond and prove that 
^ ■ (i) the optimal power allocation for the energy harvesting sensor can be 

d \ computed independently; and (ii) it unequivocally determines (and allows 

to compute) that of the battery operated one. Finally, we generalize the 
analysis for the case of finite storage capacity. Performance is assessed by 
means of computer simulations. Particular attention is paid to the impact of 
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finite storage capacity and long-term battery degradation on the achievable 
throughput. 

Keywords: wireless sensor networks, energy harvesting, cooperative 
transmission 

1. Introduction 

Sensor nodes are typically powered by batteries that, quite often, are 
either costly, difficult or simply impossible to replace. Clearly, this lim- 
its network lifetime. Energy harvesting makes it possible to overcome this 
drawback by allowing sensors to harvest energy from e.g. solar, mechanical, 
or thermal sources. The harvested energy is typically stored in a device 
(e.g. battery, super capacitor) and then supplied for communication and/or 
processing tasks when needed. In recent years, many authors have analyzed 
low to optimally use such harvested energy. For single-sensor scenarios, in 
l| the authors derive the optimal transmission policy which minimizes the 
time needed to deliver all data packets to the destination subject to causal- 



ity constraints on energy and packet arrivals. In 
step beyond and, unlike 



the authors go one 



they consider finite storage capacity effects. In 
both cases, the energy harvesting instants and amounts of energy harvested 
are assumed to be known a priori. Ozel et al generalize the analysis to 
Rayleigh-fading channels and for the case in which the information on the 
' larvested energy and channel gains is either causally or non-causally known 
jj. Other works in the literature have addressed scenarios with multiple 
energy harvesting terminals. This includes studies for the multiple-access 
Q|, interference 5], relay 6] and broadcast [3,13] channels. 

Distributed beamforming techniques allow nodes in a Wireless Sensor 
Network (WSN) to act as a virtual antenna array in order to reach a distant 



Base Station (BS) or data sink. This, however, requires accurate frequency 
and phase synchronization over sensors. To that aim, one can resort to the 
iterative synchronization scheme with one-bit of feedback proposed in Q], 
or opportunistic sensor selection schemes [lo| . 

In this paper, we consider a scenario where one energy harvesting (EH) 
and one battery operated (BO) sensor cooperate to transmit (beamform) a 
common message to a distant base station. This differs from the scenarios 
in flflfl, where ioth sensors/— had e„e rgy harvest capahili- 
ties. Our goal is to find the jointly optimal power allocation strategy which 
maximizes the total throughput for a given deadline (e.g. the time by which 
batteries could be replaced) . This problem is equivalent to the one addressed 
e.g. in {l|, [J but here we consider the more general case with multiple trans- 
mitters. Besides, and unlike the Multiple-Access Channel (MAC) scenarios 

n 

in [J], sensors here attempt to convey a common message to the destina- 
tion. We also go one step beyond the distributed beamforming approaches 
in 0, [l(J where, implicitly, all sensors were assumed to be battery operated, 
and investigate the impact of energy harvesting constraints on performance. 
As in jjj], we initially assume that the energy harvesting sensor is equipped 
with a re-chargeable battery of infinite storage capacity. In this context, we 
identify the necessary conditions for the jointly optimal transmission policy. 
This leads to a problem that we show to be convex. Furthermore, and as an 
extension to our previous work in we prove that the optimal policy for 



the EH node can be computed independently from that of the BO one, and 



propose an algorithm to compute the latter from t 



a, 



ie former. Next, we gen- 



the storage capacity of 



eralize the analysis for a scenario in which, as in 
the EH sensor is finite. We also consider imperfections in the re-chargeable 
battery of the EH sensor. More specifically, we focus on the impact of long- 



term capacity degradation, as opposed to the (short-time) battery leakage 
effects addressed in [3] 

2. Signal and Communication Model 

Two sensors cooperate to transmit a common message m(t) to a distant 
base station. The received signal thus reads 

r(t) = m(t) Wi(t)e jMt) ^J + n(t) (1) 

where the common message is given by m(t) = Y2i x l9(t ~ IT S ), with {x{\ 
standing for a sequence of zero-mean complex Gaussian symbols with unit 
variance (T s is the symbol period) and g(t) denoting the impulse response 
of a bandlimited pulse (unit bandwidth); iOj(t) = y/piitye^*^ denotes the 
time- varying complex transmit weights in polar notation (to be designed); 
giV'iW stands for the phase shift of the (Gaussian) sensor-to-base station 
channels; and n(t) is zero-mean complex additive white Gaussian noise with 
unit variance (i.e. n(t) ~ CM(0, 1)). In the sequel, we assume that by prop- 
erly designing <f>i(t) the channel phase and, where relevant, oscillator offsets 
can be ideally pre-compensated [{]]. Frequency and time synchronization is 
assumed, as well. Hence, the sensor network behaves as a virtual antenna 
array capable of beamforming the message to the base station. Without 
loss of generality, we let the first sensor be the one with energy harvesting 
capabilities, and the second to be battery operated. Consequently, we here- 
inafter denote by p (t) = pi(t) and p B (t) = P2{t) the transmit power at the 
energy harvesting and battery operated sensors, respectively. Bearing all 
this in mind, the instantaneous received power at the base station is given 



by PBpit) — (\/p H (t) + \/p B {t)) 2 ■ The total throughput for a given deadline 
T then reads 
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G T (p H (t),p B (t))= [ log (1+pBf (*))<**■ ( 2 ) 

J 

Our goal is to find the jointly optimal transmission (power allocation) poli- 
cies p H (t) and p B (t) such that Gt is maximized subject to the causality 
constraints imposed by the energy harvesting process, nameljl], 

e H (t) < E H {t) 4 E l (3) 

k:s k <t 

e B {t)<E B {t) 4 £ 2 , (4) 

for < t < T, where e^(t) = Jq p h {t)cLt and e B (t) = f^p B (r)dT denote the 
energy consumption (EC) curves; and E H (t), E B {t) stand for the cumulative 
energy harvesting (cEH) constraints (see Fig. [I]). In the above expression, 
E l k accounts for the amount of energy harvested by sensor i in the k th event 
{k = ... iV — 1). We define event Sk as the time instant in which some 
energy is harvested by any of the sensors in the network {E l k = for the 
sensor not harvesting any energy in that event). Both the events and the 
amounts of energy harvested E % k are assumed to be known a priori. Further, 
we impose Eq > for all i (sensors) so that collaborative transmission can 
start immediately, that is, from t = 0. For battery operated sensors, we have 
E\. = for k > and, thus, the cumulative energy harvesting function is 
constant for the whole period. For the EH sensor, on the contrary, it is given 
by a staircase function. Finally, we define epoch as the time elapsed between 
two consecutive events s^ and s^-i- Its duration is given by = s^ — Sk-i 
for k = 1 . . . N — 1 and, likewise, we define — T — Sjv-i- A given 
transmission policy is said to be feasible (yet, perhaps, not optimal) if, as 

1 For scenarios where the storage capacity of the EH sensor is finite, additional con- 
straints must be introduced (see Section [5j . 
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imposed by ([3|) and (j3]), the energy consumption curves lie below cumulative 
energy harvesting ones at all times (or occasionally hit them). 

3. Necessary conditions for the optimality of the transmission pol- 
icy 

The following lemmas give the necessary optimality conditions under the 
assumption of infinite storage capacity for the EH sensor. Moreover, the 
insights gained into the problem structure allow us to compute the jointly 
optimal transmission policies in Section [4l Unless otherwise stated, the 
lemmas hold for both the energy harvesting and battery operated sensors. 

Lemma 1. The transmit power in each sensor remains constant between 
consecutive events. 

In other words, the power/rate in each sensor only potentially changes 
when new energy arrives to any of them The proof of this lemma, which 
is based on Jensen's and Cauchy's inequalities, can be found in Appendix A. 
This lemma implies that p (t) = p B ,p B (t) = p B for S£,._i < t < s^. That is, 
the power allocation curves p H (t) and p B (t) are necessarily staircase functions 
and, hence, the energy consumption curves e H (t) and e B (t) are piecewise 
linear. This observation allows us to pose the original problem ([5]) in a 
convex optimization framework in which a numerical (or analytical) solution 
is easier to find. This will be accomplished in the next section. 

Lemma 2. All the harvested/stored energy must be consumed by the given 
deadline T. 

2 In our scenario, only one sensor harvests energy. Still, this lemma holds for a more 
general case with multiple energy harvesters. 



I. 



This means that, necessarily, the cumulative energy harvesting curves 
reach the energy consumption constraints at time instant T. 

Proof. Lemma [2] can be easily proved by contradiction. Assume that the 
optimal transmission policy does not fulfill such condition. We could think 
of a feasible policy such that (i) the set of curves e H (t) and e B {t) differ from 
the optimal ones in the last epoch only, namely, for t G [s;v-i . . . T); and 
(ii) it verifies e (T) = E H (T) and e (T) = E B (T). Being piecewise linear 
(and continuous), these curves would necessarily lie above the optimal ones 
at least in part of such last epoch, this resulting in a higher received power 
and throughput. This contradicts the optimality of the original transmission 
policy. □ 

Lemma 3. If feasible, a transmission policy with constant transmit power in 
each sensor between any two (i.e. not necessarily consecutive) events turns 
out to be optimal for the period of time elapsed between these two events. 

This lemma goes one step beyond and states that Lemma [1] also holds 
for non- consecutive events, as long as a constant transmit power policy in 
both sensors is feasiblqfl for this period. This follows directly from the proof 
of Lemma [T] but, since one or more energy harvests might take place in 
between the initial and final events, feasibility needs to be ensured (clearly, 
this is not needed in Lemma [T}. 

Lemma 4. The transmit powers for an energy harvesting sensor with infi- 
nite storage capacity are monotonically increasing, i.e. p^ < p^ < • • • < Pjj 



3 In our setting, this can only be constrained by the cEH curve of the EH sensor. 
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Proof. This property follows from the fact that E (t) is a staircase func- 
tion. Assume that the power allocation policy before s^-i and after s^+i 
is optimal. As shown in Fig. [21 the optimal EC curve verifies e H (si~+i) £ 
(e H (s k -i), E H (s k+ i)] ■ For e H (s k+ i) G (e H (s k -i), e H ] , we know from Lemma 
[3] that a constant power allocation for the energy harvesting and battery op- 
erated sensors turns out to be optimal for [s k -i, Sfc+i) (and, hence, for [0, T]). 
In particular, this implies that p k+1 = p k ■ For e H (s k +i) € (ej?, E H (sk+i)] , 
on the contrary, the fact that e H (t) is continuous and piecewise linear can 
only be ensured if (and only if) p k+1 > p k ■ By repeatedly applying this 
reasoning to all consecutive epoch pairs the proof follows. As for the rela- 
tionship between p k+i and p^ , nothing can be said yet. Still, the fact that 
is a constant function does not impose any additional restrictions to 
the power allocation policy of the BO sensor in [s^-i, Sfc+i]. □ 

Lemma 5. Transmit powers are strictly positive. 

Proof. Again, this can be proved by contradiction. Assume that the power 
allocation policy before s k -i and after s^+i is optimal. Assume that, as 
shown in Fig. [SJ the optimal policy for the Sfc+i) period verifies pf? = 

and p k+ i > 0. One could think of a new (and feasible) transmission policy 
given by pf = .^f^ and pf = ■ fc ^_ i for t E [a fc _i, a fc ); and = 
along with p k+1 = P k+ i for t G [sk,s k+ i). From the proou of Lemma EJ 
we know that the new policy achieves higher throughput than the original 
one in [sk-i,Sk) and, thus, in [s^-i, s^+i) too. Yet not optimal (since this 
new policy e.g. contradicts Lemma [1]), this proves that the original policy 

4 Although Lemma[3]holds for EH events, its proof has a broader scope and encompasses 
any time instant, such as s' k . See Appendix A. 
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given by p B = and p^+i > was not optimal either. Along the same lines, 
one can easily find a new feasible policy achieving higher throughput than 
pf = and pf + 1 > for the battery operated sensor. □ 

4. Computation of the optimal transmission policy 

Lemma Q] allows us to re- write the original optimization problem given 
by the score function ([2]) and the causality constraints ([3j) and (j4]) as follows 
(to recall, our focus here is on scenarios where the storage capacity of the 
EH sensor is infinite): 



max T fc log ( 1 + (\/pf + \ pf ) 2 J 



{P- 

s.t.: 

n n— 1 



(5) 



J>pf <^ = J>^ forn=l...iV (6) 



k=l k=0 
n 



£> fe p£ < E* = El forn=l...iV (7) 

fe=i 

pf > for k = 1 . . . N (8) 

pf > for k = 1 . . . N (9) 

where we have defined E B — E H (t) for t € [s n -i, s n ), and where the last two 
strict inequalities follow from Lemma [SJ The problem is convex since all the 
constraints are afline and linear, and the objective function is concave, as we 
will prove next. To that aim, we observe that the /c th term in the summation 
exclusively depends on the corresponding optimization variables pf and pf 
(i.e. no cross-term variables). Hence, it suffices to show that an arbitrary 
term in the summation, namely, G±(p H ,p B ) = rlog(l + (\/ p H + \fp B ~) 2 ) 
is concave (indices have been omitted for brevity). Or, alternatively, that 
G2(p H ,p B ) = —G\(p H ,p B ) is convex. The latter can be verified by realizing 



that, for p > and p > 0, its 2 x 2 Hessian is positive definite, namely, 
\7 2 G2(p H ,p B ) >~ 0. Since the optimization problem is strictly convex, its 
unique solution can at least be found numerically (e.g. by resorting to in- 
terior point methods). However, this task is computationally intensive, in 
particular when the number of energy harvesting events N is large. This mo- 
tivates the following lemma and two theorems from which a semi-analytical 
and less computationally intensive solution to the optimization problem can 
be obtained. 

Lemma 6. The jointly optimal power allocation policy is such that, when- 
ever the transmit power changes, the energy consumed by the energy har- 
vesting sensor up to that time instant, equals the energy harvested by such 
sensor up to that instant (i.e, the stored energy is zero). 

The proof of this Lemma is based on the Karush-Kuhn- Tucker (K.K.T.) 
conditions associated to the (joint) optimization problem ([5|)-([9]). Details 
can be found in Appendix B. 

The next theorems state the main result of this paper since they allow 
to effectively compute the optimal transmissions policies for the EH and BO 
sensors, respectively. 

Theorem 1. The optimal transmission policy for the energy harvesting sen- 
sor, {pj?}k=i> can be computed independently from that of the battery op- 
erated one. The associated energy consumption curve turns out to be the 
shortest string starting in t = 0, ending in t = T, and lying below the cu- 
mulative energy harvesting curve. 

Proof. As we will prove next, Lemmas [T] to [6] unequivocally determine the 
optimal transmission policy for the EH sensor. First note that, in order to 
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satisfy the energy causality constraint, the corresponding EC curve must 
lie below the cEH curve. Prom Lemma [2J it follows that the EC curve 
reaches the cEH curve at t = T. Besides, from Lemmas [T] and we know 
that the transmit power only potentially changes at the energy harvesting 
events. Consequently, the optimal EC curve must be linear between them 
(i.e. piecewise linear). Moreover, Lemma [6] dictates that, whenever the 
transmit power (slope) changes at an energy harvesting event, the EC curve 
hits the cEH curve. Based on these facts, we conclude that the first linear 
part of EC curve must connect the origin with some corner point on the 
cEH curve (see Fig H]) . Because of Lemma HI we must choose the one with 
the minimal slope, since otherwise the constraint on energy causality (point 
c) or monotonicity property of Lemma [5] (point a) would not be satisfied. 
Clearly, in Fig U] this corresponds to point b. Once this point is identified, 
the algorithm can be iteratively applied until we find the optimal policy 
until deadline T. As a result, the EC curve is given by the shortest string 
below the cEH curve. It must be noted that this algorithm is equivalent to 
the one presented in However, the interesting points are that (i) we have 
proved that it continues to be optimal in a scenario where two sensors, one 
of them battery operated, collaborate to send the message (vs. one sensor 
in [lj]); and that (ii) no information on the BO sensor (i.e. its optimal EC 
curve) is needed to determine it. □ 

Theorem 2. Upon finding the optimal transmission policy for the energy 
harvesting sensor, the optimal transmission policy for the battery operated 
one, {p?}u = i, can be computed with the iterative procedure given by Algo- 
rithm [IJ 

Proof. This algorithm stems from the proof of Lemma in Appendix B (see 
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Algorithm 1 Optimal policy for the battery operated sensor 
1: choose some small 5 > > Step for searching 

2: m := > Iteration index 

3: E^ := Eq > Energy stored in the battery 

4: repeat 

5: m := m + 1 

6: for all k = 1 • • • N do 

7: B fe , m := m<5, 

8: solve if = gfe£i^^g^^g^ for ^ 

/ yl \ 2 



10: end for 

12: until E^ m = 

13: pf <- p£ m VA; 



Remark). The real- valued variables and (or their counterparts for 
iteration m, namely, A\~^ m and Bk m ) are linear functions of the Lagrange 
multipliers associated to the constraints (jSJ) and ((TJ), respectively. Therefore, 
the equation in Step 8 provides a connection between the primal and dual 
solutions of the problem. Since p^ is already known from Theorem [TJ for 
each value of B}. m to be tested (from Appendix B we know that all the 
BkS are identical and equal to the largest Lagrange multiplier associated 
to ([TJ), which is enforced in Step 7), the associated Ak t m can be found by 
solving the corresponding third order equation (a single real-valued root 
exists). From p? , Af. m , and B^ m , an estimate of the optimal transmission 
policy for the battery operated sensor for the current iteration, namely, 
{Pkm}k=v follows in Step 9. If the total energy consumed until time instant 
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T by the battery operated sensor, computed in Step 9, equals the energy 
(initially) stored in it, E^ , the iterative algorithm stops. The stopping 
condition not only ensures that Lemma [2] is fulfilled but also, it implies that 
the whole transmission policy for the battery operated sensor {p^}^ = i is 
feasible. In summary, we have found the optimal transmission policy for 
the BO sensor by (i) conducting a grid search over one variable of the dual 
solution ; and (ii) checking in each iteration whether the unknown part 
of the primal solution results from the algorithm is feasible. Clearly, the 
choice of 5 leads to a number of trade-offs in terms of accuracy and number 
of iterations needed. □ 

As for algorithmic convergence, one can easily prove that each element 
in the set of transmit powers {p^}^ = i is a monotonically decreasing function 
in vn (the only non-zero element in the dual solution, see Appendix B for 
details). Likewise, Ej, is a monotonically decreasing function in z/jy as well. 
In other words, there exists a one-to-one mapping function between the 
primal and dual solutions. This turns out to be a sufficient condition for the 
algorithm to converge, as long as a sufficiently small step size 5 is used for 
the grid search over some range of values. 

Finally, Fig. [5] depicts the optimal transmission policies corresponding 
to the EH and BO sensors for a specific realization of the energy arrivals. 
Clearly, (i) it satisfies all the lemmas and theorems; (ii) Lemma U] on the 
monotonicity of the optimal power allocation does not hold for the BO 
sensor; and (iii) in order to collaboratively transmit data, the BO sensor 
must adopt an optimal transmission policy which is different from that of 
the single-sensor scenario, that is, constant transmit power within [0 . . . T\. 
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4-1. Computational complexity analysis 

To recall, the computation of the optimal transmission policy for the EH 
sensor entails the determination of a number of piece-wise linear functions 
with minimal slope which connect a subset of the N corner points on the 



cEH curve (see proof of Theorem [T] and 



In the worst case, the total 



] curve equal Jf| N 

point (actually, the origin), the total number of slopes to be checked equals 
N, that is, as many as the number of corner points up to t = T. For the 
second corner point, the total number of slopes equals N — 1. The total 
number of operations is, thus, N + (N — 1) + ... 1 = jV '(^~ 1 ) _ Hence, 
the complexity associated to the computation of the optimal transmission 
policy for the EH sensor is 0(N 2 ). As for the BO sensor, each iteration of 
Algorithm [JJ entails the computation of N transmit powers (Steps 6 to 10). 
When a bi-section scheme is adopted (rather than the grid search we actually 
used in Algorithm [I} , the total number of iterations needed is on the order 
of log(-) [13], where e denotes the constraints prescribed tolerance. Hence, 
the complexity associated to the computations of the optimal transmission 
policy for the BO sensor is C?(iV log(^)). In conclusion, the computational 
complexity of the proposed scheme is dominated by that of the algorithm 
presented in l(] and it reads 0(N 2 ). 

5. Generalization to scenarios with finite storage capacity 

Unlike in previous sections, here we assume that the energy storage ca- 
pacity of the EH sensor, E ma ^, is finite. If, in the k-th event, the energy 
harvested by the EH sensor El exceeds the remaining storage capacity at 



3 The actual number depends on the specific realization of energy arrivals. 
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that time instant, a battery overflow occurs. That is, its re-chargeable bat- 
tery gets fully charged and the excess harvested energy is simply discarded. 
In Appendix C, we prove that any transmission policy allowing battery over- 
flows to occur is strictly suboptimal. Assuming that|£| El < E miix V/c, those 
suboptimal solutions can be removed from the feasible set by imposing that 

e H (t) > E s (t) 4 El - £ max (10) 

for < t < T, where E s (t) denotes the cumulative energy storage (cES) 
constraint. One can easily verify that Lemmas [T]|3] and Lemma [5] still hold 
for the case of finite storage capacity. On the contrary, Lemma H] does not, 
as we will discuss in the proof of Lemma Since, in particular, Lemma 
[T] does hold, the optimization problem can be posed again by the set of 
equations given by ©-Q in Section [H along with the additional constraint 
(fit)]) , namely, 

n n 

Y^TkPk >E% = Y,E\- £ m ax for n = 1 . . . N. (11) 

fe=l fc=0 

A graphical representation for this additional constraint can be found in 
Fig. [6l Clearly, a transmit policy is now feasible when the corresponding 
EC curve lies inside the tunnel defined by the cEH and cES curves. The 
additional constraint (lllh is affine and therefore the optimization problem 
continues to be convex. 

The next Lemma is an extension of Lemma [6] for the case of finite storage 
capacity: 

Lemma 7. The jointly optimal power allocation policy when the storage ca- 
pacity of the EH sensor is finite is such that, whenever the transmit power 



^Otherwise, part of the energy in each arrival will be unavoidably wasted. 
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changes, its re-chargeable battery is either fully charged or completely de- 
pleted. 

Proof. The proof of this Lemma is based again on the Karush-Kuhn- Tucker 
(K.K.T.) conditions associated to the new optimization problem and it can 
be found in Appendix D. This Lemma can also be regarded as an extension 
of Lemma 4 in [2|] for scenarios with multiple sensor nodes forming a virtual 
array. In essence, Lemma [7] states that changes in the slope of the EC curve 
can only occur when it hits either the cEH curve (depleted battery) or the 
cES curve (fully charged). Intuitively, this is the reason why Lemma H] (on 
the monotonically increasing behavior of transmit powers for the EH sensor) 
does not hold anymore in scenarios with finite energy storage capacity. This 
extent is illustrated in Fig. [7J □ 

In the same vein of Theorem [IJ one can easily verify that Lemmas Q3- 
EJ [5l and [7] unequivocally determine the optimal transmission policy for 
the EH harvesting sensor with finite storage capacity. Since those Lemmas 
are equivalent to the ones presented in [2| for the single sensor case, the 
(jointly) optimal transmission policy for the EH sensor here can be again 
independently computed on the basis of algorithm Al proposed therein. 
Interestingly, the optimal EC curve turns out to be the shortest feasible 
string which, now, lies inside the tunnel given by the cEH and cES curves. 
Besides, the equation in Step 8 of Algorithm Q] in our paper continues to 
provide a connection between the primal and dual solutions of the problem 
with finite storage capacity (yet with a different definition of variables A^, 
see Appendix D). Since no additional constraints apply to the BO sensor, 
its optimal transmission policy can be computed from that of the EH one 
with Algorithm 1. 
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6. Computer Simulation Results 

In this section, we assess the performance of the proposed power al- 
location algorithm in a scenario where solar energy is harvested from the 
environment. The energy storage system in the EH sensor comprises (i) a su- 
percapacitor [l4]; and (ii) a re-chargeable Lithium-Ion battery. Upon being 
harvested, the energy is temporarily stored in the supercapacitor. When it 
is fully charged, the energy is transferred to the battery in a bursj^. Clearly, 
this validates the event-based model of the energy harvesting process pre- 
sented in Section [2J For such devices, the amount of energy harvested in 
each event is constant and it equals the maximum energy that can be stored 
in the supercapacitor. Since solar irradiation levels change over time (e.g. 
from dawn to noon, from winter to summer), so does the average number of 
energy arrivals (events). Consequently, the stochastic process that models 
energy arrivals is non-stationary. In the sequel, we adopt a Poisson process 
with time- varying mean given by Ag(t). From the solar irradiation data in 
[if]], the mean arrival rate from 5 A.M. to 12 P.M. (i.e. dawn to noon, with 
T = 7 h) can be fitted by the following exponential function: 

X E (t) = P E ,c, T e ct (12) 

where parameter c models the variability of the energy harvested over time 
(i.e. the rate of energy transfers from the capacitor to the battery); and 
Pe,c,t is a constant depending on the total amount of energy harvested E, 
parameter c, and the total harvesting time T. For the solar irradiation 
data in Q, it yields p E , c ,T = 3.899 • 10~ 2 and c = 6 • 10~ 5 . Figure E 



7 Pulse charging is beneficial for Lithium-Ion batteries in terms of improved discharge 
capacity and longer life cycles [if]]. 
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shows a number of cumulative energy harvesting curves for different values 
of parameter c: the higher c, the higher the energy variability, i.e. the 
steeper the curves by the end of the observation interval. Hereinafter, we let 
E T = T,k=o E l and E T = E l denote the total energy harvested by /stored 
in the EH and BO sensors, respectively; whereas Et = E^ + E^ accounts 
for the total energy in the system. Further, we define Re = EM /ES as the 
ratio between the total energy in the BO and EH sensors, that is, for large 
Re, the battery operated sensor dominates. In all plots, we have T = 7 h 
(from 5 A.M. to noon). 

6.1. Infinite Energy Storage Capacity 

Initially, we assume that the storage capacity of the aforementioned 
Lithium-ion battery is infinite. In Fig. [9j we depict the throughput of 
the virtual array with the jointly optimal transmission policies defined by 
Theorem [1] and Theorem [2] for the EH and BO sensors, respectively. The 
amount of energy harvested by/stored in the EH and BO sensors is identical 
{Re = 1) and results are shown as a function of the total energy Et in the 
system. As benchmarks, we consider (i) a system with only one EH sensor, 
the cEH curve of which is given by the point-wise sum of the cEH curves for 
the EH and BO sensors in the virtual array (curve labeled with "1H, sum of 
cEHs"); and (ii) a two-sensor virtual array in which the transmission policies 
or the EH and BO sensors are optimized individually for each sensor as in 
l|, which is suboptimal for a virtual array ("1H+1B, suboptimal"). For (ii), 
the optimal policy for the BO sensor consists in a constant transmit power 
for t = . . . T. Unsurprisingly, for systems with multiple transmitters the 
beamforming gain translates into substantially higher throughputs. Besides, 
some additional throughput gain results from the joint optimization of the 
transmission policies for the EH and BO sensors, that is, by forcing the BO 
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sensor to adapt to the changes in transmit power in the EH sensor. 

Fig. [TU] provides further insights on the throughput gains stemming from 
the joint optimization of transmission policies over sensors. More precisely, 
we depict the throughput gain by ratio Rq = Gx,opt / GT,subopt as a function 
of the total system energy. Interestingly, the highest gain is attained when 
the total energy harvested by the EH sensor equals that stored in the BO 
one, that is, for Re = 1. Yet in a totally different context, this is consistent 
with [n| where the authors conclude that, in order to maximize the beam- 
forming gain, the received signal levels from the opportunistically selected 
sensors must be comparable. Conversely, when either the EH or the BO 
sensors dominate (Re <C 1 or Re S> 1, respectively) the gain from a joint 
optimization becomes marginal (Rq —> 1) since the signal received from the 
other sensor is weak. We also observe that, in the case of unbalanced energy 
levels, throughput gains are lower when the BO sensor dominates. This is 
motivated by the fact that when <C Ej, the policy for the EH sensor 
has very little impact in the definition of the (jointly) optimal policy for the 
BO one. In other words, the energy consumption curves for the BO sensor 
with and without joint optimization are similar and, hence, the throughput 
gain approaches 1. It is also clear that throughput gains become negligible 
when Et increases (i.e. in the high SNR regime). Let a = £ ; r,high/-E'r,iow 
denote the ratio of total system energies in the high and low SNR regimes. 
Since the total received power pBf{t) scales with a, from the score function 
in (p)} and for large -Er.high we can write, 

d tw \ _ Nlog(a) + G T ,opt(ET ,low ) 
Ariog(a) + G r ,subo P t(-Er,iow) 

Clearly, for large a the impact of the specific transmission policies (optimal/ 

suboptimal) diminishes. In other words, joint optimization of transmission 
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policies is more relevant in the low-SNR regime. 

Next, Fig. [TT] illustrates the impact of the variability of energy arrivals 
in the throughput gain. Clearly, the higher the variability (i.e. for higher 
values of parameter c), the higher the gain: Rq = 1.2 (or +20% gain) for 
c = 3 • 10 -4 and Er = 10 J. On the contrary, if the average number of 
arrivals does not vary (increase) substantially in the observation interval, 
the gain stemming from a joint optimization of both transmission policies is 
marginal (Rg ~ 1). In conclusion, rapid variations of solar irradiation levels 
from dawn to noon (e.g. in high latitude locations, winter time) make joint 
optimization of transmission policies advisable. 

6.2. Finite Energy Storage Capacity 

Unlike in the previous subsection, here we realistically assume that the 
energy storage capacity for the EH sensor is finite. 

Figure [12] depicts the total loss in throughput with respect to the case of 
infinite storage capacity by throughpout ratio Lq. Interestingly, as long as 
the maximum storage capacity is greater than the energy harvested in each 
arrival, the throughput loss is barely noticeable (the throughput ratio equals 
1). In other words, the changes in the optimal transmission policy resulting 
from the introduction of the additional constraint (jlip , which avoids battery 
overflows, have a rather marginal impact on the achievable throughput. This 
is excellent news since, typically, storage capacity is well above individual 
harvested energy levels. On the contrary, throughput performance rapidly 
degrades for smaller storage capacities. This stems from the fact that now 
part of the energy in each arrival is unavoidably wasted in battery overflows. 
As a result, the total amount of energy stored with respect to the case of 
infinite capacity decreases, and so does the resulting throughput. 



20 



Next, we analyze the impact of battery degradation in the EH sensor 
on system performance. Our focus is on impairments due to long-term 
battery degradation due to e.g. aging. Accordingly, its storage capacity 
is assumed to take a constant value for the whole transmission period (i.e. 
no battery leakage between arrivals). The nominal storage capacity E™™, 
on which basis the optimal transmission policies for the EH and BO are 
computed, is assumed to be known. On the contrary, the actual capacity 
-^max al — ^maxi which enables data transmission, is unknown. The fact that 
the actual capacity is lower that its nominal value may result into battery 
overflows and early battery depletion (see Fig. [T3|h both having a negative 
impact on the achievable throughput. Despite of the introduction of the 
additional constraint (jlip . now there is a risk to waste part of the energy 
arrivals in battery overflows since the remaining battery capacity is smaller 
than expected. As an example, for the particular realization in Fig. [T3l the 
total energy actually harvested within . . . T amounts to 13.375 J instead 
of 16 J. Likewise, the fact that the actual energy stored in the battery is 
lower than expected might lead to early battery depletions. This forces data 
transmission for the EH sensor to be suspended until the next energy arrival. 
Consequently, the beamforming gain vanishes for this period of time. 

In Fig. [Til we investigate the impact of battery overflows and early de- 
pletions on throughput. More specifically, we depict the throughput ratio 
Lq = G , r,actuai/G ! T',nom as a function of the ratio between actual and nominal 
battery capacities, namely, Rc = ^max /-^max- Unsurprisingly, throughput 
degradation is particulary severe and faster for smaller values of nominal ca- 
pacity (i.e. for -Emax — 0-05 J). In this case, the amount of energy in each 
arrival (El = 2.21 • 10~ 2 J) is comparable to the nominal capacity. Conse- 
quently, many battery overflows and early depletions occur. Furthermore, 
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for Rq = 0.1, the actual battery capacity amounts to i?^ = 5 • 10 
which is below £7i. Hence, every energy arrival causes a battery overflow 
which results into a throughput loss of 60%. It is also worth noting that for 
large nominal battery capacities (-E^ax = 1 J) and higher values of capacity 
degradation [Rq = 0.1) there is also a noticeable throughput loss (some 
10%). Even though the actual battery capacity (S^ 1 = 0.1 J) is well 
above E^,, the mismatch between nominal and actual capacities results into 
some battery overflows and early depletions too. 

7. Conclusions 

In this paper, we have derived the jointly optimal transmission policy 
which allows an energy harvesting plus a battery operated sensor node to act 
as a virtual antenna array to maximize throughput for a given deadline. The 
necessary conditions for optimality that we have identified, both for scenar- 
ios with infinite and finite energy storage capacity in the energy harvesting 
sensor, allowed us to prove that the optimal transmission policy for the 
energy harvesting sensor can be computed independently from that of the 
battery operated one according to the procedure described in [J and Q, re- 
spectively. Interestingly enough, we have proved that such policies continue 
to be optimal for our too-sensor (vs. single-sensor) scenario. Moreover, we 
have shown that the optimal transmission policy for the battery operated 
sensor is unequivocally determined and can be iteratively computed from 
that of the energy harvesting one. The resulting policy is, in general, dif- 
ferent from that of battery operated sensors in single-sensor scenarios (i.e. 
constant transmit power). The performance of the jointly optimal transmis- 
sion policy has been assessed by means of computer simulations in a realistic 
scenario where solar energy is harvested from the environment. Computer 
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simulation results revealed that, in scenarios with infinite storage capacity 
in the energy harvesting sensor, the joint optimization of transmit policies 
in combination with beamforming yields substantial throughput gains. The 
highest gain is attained when the total energy in the energy harvesting and 
battery operated sensors are identical. However, the gain becomes negli- 
gible in high-SNR scenarios where large amounts of energy are harvested 
by/stored in sensors. In the case of unbalanced energy levels, throughput 
gains are lower when the BO sensor dominates. Besides, we have found that 
throughput gain is larger when solar irradiation levels vary rapidly. We have 
also learnt that throughput losses stemming from finite storage capacity are 
only substantial when battery capacity is smaller than the amount of energy 
in each arrival. Finally, we have observed that a long-term degradation of 
battery capacity may result into battery overflows and early battery deple- 
tions. The associated throughput loss is particulary severe for smaller values 
of the nominal storage capacity. Still, the impact of the mismatch between 
nominal and actual capacities can also be noticeable for larger values. 
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Appendix A. Proof of Lemma [TJ 

Assume that the optimal policy before s^-i and after is optimal. 
The total throughput in the fc-th epoch is given by G Tk = f^ k log(l + 
PBF(t))dt where, to recall, we defined pBF\t) = (1/ p H (t) + \/p B {t)) 2 as 
the instantaneous power received at the base station from the two sensors. 
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Besides, let AesF = /** PBF(t)dt denote the total received energy in the 
A:-th epoch of duration = — Sfc-i- Prom Jensen's inequality [17j| [Sec. 
7.2.5], we have that the following inequality: 

j b a g{f{t))h(t)dt < h- b a f(t)h(t)dt \ 

IlKt)dt ~ 9 \ ! h a h{t)dt ) 

holds as long as g(-) is a concave function, f(t) is such that a < f(t) < /3, 
and h(t) > 0. Letting #(p) = log(l +p), f(t) = pBFif) and h(t) = 1 yields 

G Tk = f k log(l + p BF (t))dt 

/ Ae BF \ 
= r fe log 1 + 



This last inequality evidences that for a given energy Ae^i?, the optimal 
power allocation policies for the k-th. epoch must be such that the instanta- 
neous received power at the BS is constant and equal to PBF{t) = AeBF/ T k- 
In order to determine the optimal transmission policy for each sensor, we 
resort to Cauchy's inequality [l^] [Sec. 7.2.5] to learn that 

f r PBF(t)dt) < ( r p H (t)dt) + 1 r p^dt) 

or, equivalently (see Fig. [T5j) . 

f k PBF{t)dt < (VKe 11 + VAe^Y . (A.2) 

By replacing (|A.2|) into (|A. 1[) . we finally get: 




G Tfe <r fc log 1+ \\— + \ /— . (A.3) 



In other words, the individual power allocation policies that maximize the 
throughput in the k-th epoch consist in using a constant transmit power 
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given by p (t) = Ae H /t^ and p B (t) = Ae B jr^ for the EH and BO sensors, 
respectively. This concludes the proof. 

Appendix B. Proof of Lemma [6] 

The Lagrangian of the optimization problem Q is given by 

A=-Er fc log(l + (^+^f) 2 ) 
fc=l v ' 

N / n \ N / n \ 

n=l \fc=l / n=l \fc=l / 

N N 

fe=l k=l 

and, hence, the corresponding K.K.T. conditions read 

^,^ = (B.2 

n 

J^nPk <E%forn = l...N (B.3 
fe=i 

n 

5> fc j>f forn = l...iV (B.4 

k=l 

Pk,Pk>V (B.5 
K,v n ,fa,tik>Q (B.6 

(E 7 *^ - -^ =0forn= 1...JV (B.7 
(E T ^f "^n J =0forn= 1...AT (B.8 



\fc=i / 

-/} fc pf = for & = 1 . . . N (B.9 

-Ifcpf = 0forfc = l...iV. (B.10 
where the partial derivatives in (|B.2p are can be expressed as 
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^ ^(l + (^f + ^f) 2 

a~B = ~ r fc ; / 7= ; \ + T k >^ M n 



n=k 
N 



Pk i 1 + (\ Pk + \ Pk) 2 



n=k 



Prom equation f|B.2[) and by introducing the change of variables A k 

* N .\ - h. and B k - ^ N r, _ k 
k-\h epoch, p^ and p^ , yield 



X^n=fc ^« ~~ ^ an d ^fe = Sn=fc v n ~ f^i the optimal transmit powers in 



fc i fc (ifc + B fc ) 2 

_ Ak(Ak + Bk — AkBk) (~R -\9) 

Pk ~ B k (A k + B k y 

Since, as stated in Lemma [5] and equation (|B.5|) above, p k ,Pk / 0, the 
complementary slackness conditions (|B.9j) and fjB.lOj) . force the correspond- 
ing Langrangian multipliers to vanish, i.e. fik,£,k = 0. When transmit 
power changes, we have p^ < p k+ x- From (|B.11|) and the Remark be- 
low, this can only hold if Ak ^ A k +i or, equivalently, if Xk ^ (to recall, 
\Xk = 0). From the complementary slackness condition in (|B.7p . we have 
that Xk / => Yli=i T iPf ~ E k — 0. That is, the energy consumed by 
the energy harvesting sensor up to s k , equals the energy harvested by such 
sensor up to that instant (see Fig. [IB]) . This concludes the proof. 

Remark: From Lemma [21 we know that Ylk=l T kPk = -^0- Since, in 
addition ^ this yields ££ =1 r fc pf - Eg < for all n = 1 . . . N - 1. 
From the complementary slackness condition of ()B.8[) . we conclude that, 
necessarily, v k = for k = 1 . . . N — 1. This, along with the fact that jif. = 
for all k, implies that B k = B^ = u^, Vfe, that is, all BkS are identical. This 
property is a cornerstone of Algorithm [1] since it turns an TV-dimensional 
exhaustive search into a single-dimensional one. 
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Appendix C. Transmission policies with battery overflows are sub- 



Here we show that any transmission policy resulting into battery over- 
flows in the EH sensor is strictly suboptimal. We will prove this by con- 
tradiction. Assume that a transmission policy with battery overflow at Sk 
only (Fig. QH left) is optimal. Let U H = {pf , . . . ,pf_ ± ,pf ,pf +1 , . . . ,p§ } 
and Tl B = {pf , . . . ,p B ^i,pf >Pk+n ■ ■ ■ iPn} denote the corresponding opti- 
mal transmission policies for the EH and BO sensors, respectively. We can 
think of an alternative (and feasible) transmission policy II = {II , IT 5 } 
such that, on the one hand, 11^ = {pf ' , . . . , P^-ii Pk ■> Pk+v Pn} ana -> 011 t ne 
other, tl B = U B . That is, the new policy only differs from the optimal one 
in the power allocated to the EH sensor in the k-th. epoch. By properly ad- 
justing pH , the battery overflow at Sk can be avoided (Fig. [T7]right). Since, 
clearly, pf > pf , the throughput in the k-th. epoch is higher, this resulting 
into a higher total throughput in [0 . . . T] . This contradicts the claim that 
the original policy n = {n^,n B } is optimal and concludes the proof. 

Appendix D. Proof of Lemma 

The Lagrangian C2 of the new optimization problem with finite battery 
capacity constraints is given by 



optimal 




fc=i 



N / n \ N 



n 



Y,r k pf~E S n 



n 



) 



n=l \fc=l / n=l 



k=l 




(D.l) 



The new K.K.T. conditions thus read 
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dC 2 dc 2 _ 

n 

£> fc pf <E%forn = l...N 

k=l 
n 



Y,nPk >E% forn = l...iV 
fe=i 

n 

^nPk < E%i<xn = 1...N 



k=l 



Pk,Pk > o 



An; TTnj ^n> A*fc> 

An ( f>fcif - E* J = forn = 1 . . . N 



D.2 
D.3 
D.4 

D.5 

D.6 
D.7 

D.8 
D.9 



K \J£ Tk P* ~ E nj = fOT n = 1 • • • N 

K (j2 Tk Pk ~ E n \ = forn = 1 . . . N (D.10 

-pikPk =0iovk = l...N (D.ll 
-&p£ = 0for k = 1...N (D.12 

where equation (|D.9|) accounts for the additional constraint given by (|lip . 
and {7r n } denote the corresponding set of Lagrange multipliers. Since the 
additional constraint does not apply to the BO sensor, the partial derivative 

is identical to that in Appendix B, namely, = On the contrary, 

differs and, more specifically, it reads 

7T77 = ~Tfc ; -, i \ +Tfc(> A n . -7T n )-/X fc (D.13) 



Pk ( 1 + UPk+^Pk) 2 



n=k 



From (]D.2p and by introducing the change of variables = X^n=fc(An — 
VTn) — 7^ and = Yln=k Vn — ^r, the optimal transmit powers in fc-th epoch, 
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p^ and p^ , again yield 

~h _ Bk(A k + B k - A k B k ) (Til A) 

Pk ~ A k {A k + B k f 

_ A k (A k + B k - A k B k ) 
Pk ~ B k {A k + B k ) 2 
Equation (|D.6|) and the complementary slackness conditions (jD.lip and 

()D,12p again force the corresponding Lagrangian multipliers to vanish, i.e. 
Mfci^fc = 0. As in Appendix B, the transmit power changes (p^ ^ P k +i) 
iff A k 7^ A k+ i or, equivalently, if A/% — n k ^ 0. This is only possible 
for the following combinations of values of the Lagrangian multiplier: (i) 
Afc / 0, 7T fc = 0; (ii) X k = 0,7f& / 0; or (hi) X k / 0,Ti k Q Ak ¥= %k- The 
conditions (i) and (ii) accounts for cases in which the EC curve hits the 
cEH or cES curves at s k respectively; whereas (hi) accounts for the case in 
which the cEH and cES curves coincide at time instant s k (i.e. when energy 
harvested at s k equals battery capacity, namely, E\ = E m3iX ). 
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Figure 1: Network with energy harvesting and battery operated sensors (left); cumulative 
energy constraints and energy consumption curves (right). 
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Figure 3: Transmit powers are strictly positive (Lemma [5]) 
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Figure 4: The optimal energy consumption curve for the EH sensor is given by the shortest 
string below the cumulative EH curve (Theorem [TJ 
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Figure 7: Optimal transmission policy for the EH sensor with infinite (top) and finite 
(bottom) battery capacity. 
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Figure 8: Typical cumulative energy curves parameterized by c. 
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Figure 12: Throughput ratio (loss) as a function of battery capacity (N = 2250 
epochs). Big round markers on the curve correspond to the operating points where 
the maximum storage capacity -E max equals the energy harvested in each arrival E k 
(El = 2.21 ■ 10" 3 , 2.21 ■ 10" 2 , 2.21 ■ 10 _1 for the E T = 10, 100, 1000 J curves, respectively). 
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Figure 13: Battery overflow and early depletion phenomena: transmission policies for the 
EH sensor with nominal battery capacity (top, = 5 J) and actual capacity (bottom, 

^max = 2.5 J) for a given realization of energy arrivals. Dotted arrows indicate the 
arrivals in which part of the energy is wasted (E k = 2 J). As a reference, the lower plot 
includes the cEH curve for the nominal capacity (dash-dotted line). 
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Figure 15: Transmit power in each sensor remains constant between consecutive events 
(Lemma [T}. 
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Figure 16: When transmit power changes, the energy consumed by the EH sensor equals 
the energy harvested (Lemma |6}. 
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Figure 17: Transmission policies with battery overflows are strictly suboptimal. 
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